Event by Event Analysis and Entropy of Multiparticle Systems 
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1. Entropy, being one of the most important characteristics of a system with many degrees of freedom, is — in 
particular — an important characteristics of multiparticle production processes. In this context it abounds in analyses 
of dense hadronic matter and in discussions of various models of quark-gluon plasma |Q| . 

\ Processes in which particles are produced can be considered as the so called dynamical systems |^,^ in which 
— generally — entropy gets produced. Although application of the mathematical theory of dynamical systems to 
[ calculate the entropy in multiparticle production is still out of reach, the existing models suggest that the systems 
produced in high-energy collisions pass through a stage of (approximate) local statistical equilibrium Q . 
' Recently Q we have proposed to apply the event coincidence method Q to measure entropy of a multiparticle 

I , systems, provided it can be described by a microcanonical ensemble]^. Since the event-by-event analysis becomes a 
^ ■ commonly accepted tool to study the multiparticle phenomena, we feel that it is worth to pursue this problem further. 
^\ \ In the present paper we extend the coincidence method to the more realistic case when the energy of the system in 
question is not necessarily fixed. We show that the method can be rather effective investigating local properties of 
^ the particle spectra. Since the observed particles map the state of the system just before it breaks into freely-moving 
O^ . hadrons (which get registered in the detectors), such a measurement can provide an important information on the 
evolution of the systemg. 

At this point it may be important to stress that to estimate properly the entropy of a multiparticle system one 
would need information not only on distribution of momenta but also about positions of particles. In particular, 
correlations between positions and momenta are very essential. This information cannot be obtained, generally, in a 
model-independent way. One should thus keep in mind that the entropy we discuss in the present paper reflects only 
O i' partially the statistical properties of the system: the degrees of freedom related to positions of particles are integrated 
over. Nevertheless it provides a valuable information about the system in question, and can be used to identify its 
nature. In particular, our method may have a wide range of application for the systems where correlations between 
momenta and positions of the particles are unimportant. 

2. In a system at equilibrium with all states having the same probability (microcanonical ensemble) entropy 
measures the number T of states of the system: 



S = \ogV. (1) 

This formula can be rewritten in terms of the probability p for one of the states of the system to realize. Since all 
states have equal probabilities we have 

P^f (2) 

and thus 

S^-\ogp. (3) 

Ma observed |^ that the probability p can also be expressed as probability of "coincidence" , i.e. probability that while 
sampling the system, one finds two states (configurations) which are identical to each other. Indeed, this probability 
is given by 



direct measurement of entropy of multiplicity distribution observed in multiparticle production was first reported in 
^Note that the free movement of particle from production point to the detector does not influence this measurement. 



1 



C2= Yl ip') = ^p'=p 

all states 
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so that 
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log C2 



Now, if we measure N configurations and find N2 coincidences we have (in the hmit of large N) 



C2 



No 



N{N ~ l)/2 



(5) 



(6) 



and thus we obtain a method of estimating p and therefore also of entropy S. The attractive feature of this proce- 
dure is that, as seen from (^), the statistical error drops very fast (like N~^) with increasing number of the tried 
configurations!^. 

This method does not work, however, if the energy of the considered system is not precisely fixed (e.g. for canonical 
or grand-canonical ensemble) or if the system is not in termodynamic equilibrium. In such a case the states of the 
system have, in general, various probabilities of occurence. Consequently, neither (^ nor (^ are valid. 

In the present note we argue that even in this general case the coincidence method can nevertheless be used to 
obtain information on the entropy of the system. To this end it is, however, necessary to measure concidences of more 
than two configurations. The argument goes as follows. 

For an arbitrary system entropy is defined by the general formula j^] 
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all n 



Pn l0gp„. 
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where pn is the probability of occurence of the state labelled by n, and the sum runs over all states of the system. 
To begin we observe that (0) can be rewritten as 



S 



(logp) 



where < ... > denotes the average over all states of the system. 
Using now the identity 



p =< p > 
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(9) 



one can transform m) into 



5 = -log<p> + Vl/fl-^- 

rn—2 ^ ^ 



(10) 



In this way we have expressed the entropy by the moments < > . 

Now, the point is that these moments have a simple physical interpretation in terms of the coincidence probability. 
Indeed, let us denote by Ck the probability of coincidence of k configurations. In terms of probabilities p„ it can be 
expressed as ^: 



(11) 



all 1 



all : 



We see that the probability of coincidence of k configurations is given by the k — 1-th moment of p. 
We thus conclude from (|l^) that the probabilities Ck of coincidences of all orders are in principle necessary to 
determine the entropy of the system. 
In terms of C^s, (1^) reads 



^ This holds for A'^ in the region y/T <^ N T, the case of interest in the present context. 

■^This formula can be easily proven by considering the Bernoulli distribution of A'" independent samplings of the considered 
system. The error can be estimated with the same technique. 
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S=-logC.+ (12) 



ra=2 k=0 \ / \ ^/ 

If all states have the same probability of occurence we obtain trivially Ck+i — Thus all terms in the sum 

vanish and we fall back to the formula (||)^. 

Of course the series ( [T^ ) and its approximations may be used for estimation of entropy only if the result is convergent. 
To this end the consecutive terms must be small enough and thus the parameters Ck+i/{C2)'^ cannot be much larger 
than one|^ This condition limits seriously the applicability of (p^). 

3. It is useful to rearrange the series ( p^ using the so-called replica method To this end, let us consider a 

system made of M independent replicas of the considered system. The entropy of such a composite system is obviously 
given by 

S{M) = MS. (13) 
On the other hand, since it is made of M independent subsystems the coincidence probabilities are given by 

Ck{M) = [Ckf'. (14) 
Consequently, repeating the argument of the previous section we obtain 



TO ' \k J V (C2)* 

m=2 k=0 ^ ' ^' 



M 

(15) 



Now, consistency of (IT^ ) and (|l5| ) requires that the sum on the R.H.S. of (15) is proportional to M and thus only the 
term proportional to M can survive. This term is easy to calculate by observing that 

^'+'^'' = l + Mlogfg^U.... (16) 



(C2)V ^\{C2) 



Substituting this into (|I5D we obtain 

00 m , 

k) ^\[C: 



SiM) = -Mlo,C, + M± ^ti-iri":) log fg±i) . (17) 



TO 

m=2 k=0 



Using (|13D we thus have 



o.c,+f If (-!)'(:) log (!||i) 

m=2 k=2 ^ ^ \L / 



(18) 



which represents our final formula. It is providing partial resummation of the powers of Ck+i/[C2]'^ into logarithms. 
4. The formula ( p^ ) can be rewritten in terms of the Renyi entropies]] defined as ||ll| 

Hk--'^. (19) 
Using this definition one can easily see that Hi = 5". Substituting ( p^ ) into ( p^ ) we obtain after some algebra 

71=1 k=0 ^ ^ ri=0 k=0 ^ ^ 

= H2 + (H2 - Hi) + {H2 - 3i/3 + Hi) + {H2 - 3H3 + 3^4 - H5) + ... . (20) 



'Eto(-i)'(T) = (i-ir = o- 

It is not difficult to see that Cfc+i/(C2)* > 1. Indeed, for any positive variable / we have < ^(/— < / >)^ >> 0.. It 
follows that < /''+^ > - < / >''+^> 3 < / >^ (< /*"^ > - < / >''"^) and one can complete the proof by induction. 
''The argument presented in this section was suggested to us by K.Zyczkowski. 
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One sees that the first N terms of this series represent the polynomial extrapolation of the function Hk from the 
points fc = 2, 3, 4, .., + 1 to fc = 1. This observation not only explains the meaning of formulae (^8|) and (]20| ) but also 
suggests the way to improve it: one should look for more effective extrapolations. One possibility we have investigated 
in some detail is to take 

Hk - + ao + ai(fc - 1) + a2(fc - 1)^ + ... . (21) 

Number of terms is determined by the number of coincidence probabilities one is able to measure. If only C'2 and C3 
are measured we obtain 

S^H^ + ""—^^ (H2~H3). (22) 

- log 2- (1/2) log 3^ 2 31 \ I 

If three coincidences are measured we have 

S^H2 + {H2~ H^){1 + w) - u;(i/3 - i?4), (23) 



where 



LO = 



l-21og2+(l/2)log3 
log(2/3) + (2/3)log2 • 



(24) 



In Figure 1 the results of this procedure are shown for three distributions, often encountered in the analysis of 
multiparticle data: Poisson, Negative Binomial and the Geometric series. One sees that extrapolation using only two 
terms is by far sufficient to obtain an accurate value of entropy, provided the average multiplicity is not lower than 
1/2. The first term {H2) is, however, hardly sufhcient even for fairly large multiplicities. 

For n — > the extrapolation is rather poor which shows that the method is not well adapted for studies of low 
multiplicity events. 

We have also found that for these three distributions the polynomial extrapolation ( pO| ) less accurate than (pi]). 

5. We have suggested recently Q that the coincidence method of Ma can be used to estimate the entropy of the 
system of particles produced in a high-energy collision. The idea was to consider the produced events as the randomly 
chosen configurations of the system. Measurement of the (appropriately defined) probability of coincidence of two 
events was interpreted, following the formula (^), as a measurement of entropy of the system^ 

As it is not very likely that the system produced in a high-energy collision can be indeed accurately represented 
by a microcanonical ensemble at equilibrium ^, however, one may have justified doubts about the accuracy of this 
method. It is clear from the previous argument that the Eqs.dl^) and ( pO| ) provide a possibility to assess this. Indeed, 
already measuring the probability of coincidence of three events 

" N{N -1){N -2)/& ^^^^ 

allows one to estimate the first correction to the Eq.(^. As discussed in the previous section, this is often sufficient 
to obtain an accurate value of the entropy. 

6. Application of the coincidence method, as described in previous sections, for measurements of entropy in 
multiparticle production (which is our main objective) requires discretization of the observed multiparticle spectra 

The dependence of the results of measurements on discretization can be discussed as follows. 
Consider a system consisting of a certain number, say iV, of particles produced in a high-energy collision. Let 
<^{q)dq = ^{qi, ..qN)dqi...dqN be their probability distribution in momentum space. To discretize, we split the 
distribution into M (3N dimensional) bins of size Aq^ , m — 1, M. The probability distribution to find the system 
in the bin m is 

w{m,M) = $(g(i)(TO),...,g(^)(TO))Aq™, (26) 



®As explained in Section 1, we are considering only entropy related to the distribution of particle momenta. The volume 
fluctuations and correlations between the position and momentum of a particle are neglected. 
^Although this is the case in the Fermi model of multiparticle production pi. 
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where \q^^^ (m), ...,q^-'^^ (rn))] is the set of TV momenta defining the bin m. The coincidence probabihties measured 
from the distribution (Eq) are 



M 



Cfc(M) = ^(Ag„)'=[ci>(gW(m))]^ (27) 



If we now spht each bin into A new bins (and thus multiply the number of bins by factor A) the probability (|2^ 
changes accordingly and we obtain 

, A/ A ^ 

Cfc(AM) = ^ ^(Ag„)^ J2 A [HQ^'\^^lm).-,q^''HmJm))] ■ (28) 

m=l im = l 

For non-singular distribution <E>(<7i, ^at) the dependence of the sum on the R.H.S on A disappears in the limit A — > oo 
and thus using ( p^ ) or (|2^) we have 

S'(AM) =logA + 5(M) (29) 

which summarizes the dependence of the proposed measurement on the resolution used in the procedure of discretiza- 
tion!^ Note that A denotes the number of splittings in 3N dimensional momentum space. If the splitting procedure 
is performed by simply splitting the bins in one-dimensional single particle momentum distribution into Aq new bins, 
we have A — (Aq)"^^ which gives 

S{XM) ^ 3N log Ao + SiM). (30) 

The final question one may ask is how the entropy measured from the distribution ( p6| ) is related to the "true" 
entropyp] of the N particle system described by the distribution function ^{qi, qjy). To consider this problem we 
observe that the spacing between the momentum states of a system of N particles is given by the quantum- mechanical 
relation 

(27r^3^A^ 



where v denotes the volume of the systerrj^. Denoting the total number of states of the system by T the "true" 
entropy is given by 

r 

5(r) = -^p(<z(^n*),-,'?^'^n*))iog[p('7(')(*),...,9(^)(*))] 

1=1 

M 

= -J2 w{q^^'^ (to), <?(^) (m)) log[w((7(i) (m), q^^^ (m))/r(m)] 

m—l 

M 

= S{M) + ^ w{q^^\m), ...,q^^\m))\ogV{m), (32) 



where 



r(TO) 



5q 



(MM )) 
(2^) 



3 



N 



(33) 



Additional dependence on A would indicate that the distribution $(gi, ...,g]v) is singular (see,e.g., 

We use quotation mark to emphasize that, as explained in Section 1, the entropy we discuss in this paper is not -in general- 
the actual entropy of the system since it neglects the positions of particles in configuration space. 

The fluctuations of the volume can be -at least in principle- determined if the HBT correlations are measured for each 
event. 
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is the number of states in the bin m. Here Aq{M) denotes the size of the (l-dimensional) bin in momentum space of 
one particle. 

Eq. ( ^2| ) relates the entropy S{T) of the considered system to S{M) - the one measured by discretization into M 
bins. For the simplest case when all bins used in discretization are equal to each other, Tm does not depend on m 
and the last sum in ( |3^ ) can be performed. The result is 

S{r) ^ S{M) + log(r(m)) = S{M) + SiVlog (^v'/^^^^) • (34) 

7. To assess the practical possibilities of using the proposed method to the actual multiparticle data, we have 
estimated the coincidence probabilities for a system of particles produced independently. 
Suppose that the produced particles come in a number of species, labelled by /. Then 

Ck^l[Ck{f), (35) 
/ 

so that it is enough to consider one kind of particles. 

We now discretize the system by splitting it into M bins of size Aq. With this procedure, the state of the system 
is defined by giving the number of particles in each bin. If particles are emitted intependently, the probability of a 
given state is 

M 

W(ni, ....,7171/) = ]JP(ni,ni), (36) 

i=l 

where P{n, n) is the Poisson distribution with average n and is the average number of particles in a bin labelled 
by i given by 

fit^ I dqp{q), (37) 

Jqi-Aq/2 

where p(n) is the single particle momentum distribution: J dqp{q) ~ N with N being the total number of particles. 
From ( |27| ) we deduce 

M 

Ck= J2 [W{m,...,nMt =l[cr'{n^), (38) 

ni,...,nM i=l 

where 

Cr'{n) = Y.[P{nMK (39) 

n 

We have calculated numerically C^°"^(n) for 2 < fc < 5. They are shown in Fig. 2, plotted versus n. One sees that 
in the range 1 < n < 50 they can be well approximated by the formula 

which shows that they are not prohibitively small even at fairly high multiplicities. We thus conclude that for one 
bin at least C2 and C3 should be possible to measure with a reasonable accuracy even for large systems (i.e. systems 
containing many particle . 

The situation becomes much worse, however, with the increasing number of bins, as easily seen from (|38|). For 
N ~ 100 and M = 10 bins, for example, one obtains C2 ~ 10~^'^ and C3 « 10""'^^. The situation improves somewhat 
for smaller multiplicities: iV = 10 and M — 10 one has C2 ~ 10^^ and C3 ~ 10^ As shown in Section 6, however. 



'For large multiplicities the first term in the asymptotic expansion of Ck is l/(\/2fc7rn) 
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the method does not work if the particle multiphcity in one bin falls below n ~ 1/2. Therefore it is limited to study 
of rather small regions of phase-space. 

8. At this point it may be worth to point out that the measurement of event coincidence probabilities represents an 
interesting information about the multiparticle system, independently of its relation to the Shannon entropy. Indeed, 
it gives a valuable information on statistical fluctuations of the system in question and thus may be considered as 
alternative approach to the problem of "erraticity" It seems to be a more detailed measure of even- by-event 
fluctuations than the distribution of the (horizontally averaged) factorial moments |Q. The weak point is that the 
method seems applicable only to a small part of the available phase-space|^. Some averaging procedure may thus 
turn out necessary also in this case. 

It is also worth to emphasize that the event coincidence probabilities are sensitive to entirely different region of 
multiparticle spectrum than the widely used factorial moments | p^ . Indeed, whereas factorial moments are sensitive 
mostly to the large multiplicity tail of the spectrum, the coincidence probabilities obtain largest contributions from 
the region where the probability distribution is maximal. The two methods seem thus complementary to each other 
and should best be used in parallel to obtain maximum of information. 

9. In conclusion, we have proposed a generalization of Ma's coincidence method of entropy determination. It 
requires measurements of coincidences of 2,3, ... configurations. The new method can be applied to a more general 
class of systems. In particular, thermodynamical equilibrium is not necessary. 

The method seems well adapted to analysis of local properties of multiparticle states produced in high-energy 
collisions. It may thus turn out useful for investigation of the thermodynamic properties of the dense hadronic matter 
and/or quark-gluon plasma. 
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FIG. 1. Estimates of entropy for systems with commonly encountered distributions usiiig the extrapolation given by Eq.(^l|), 
plotted versus average multiplicity. Continuous lines: entropy calculated directly from (X^. Dashed lines: entropy calculated 
from Open points: Three-term extrapolation (jisj). Full points: Two-term extrapolation (p^. 
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